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ABSTRACT 

Power-law tails are often seen in probability density functions (PDFs) of molecular 
cloud column densities, and have been attributed to the effect of gravity. We show that 
extinction PDFs of a sample of five molecular clouds obtained at a few tenths of a par¬ 
sec resolution, probing extinctions up to Ay ~ 10 magnitudes, are very well described 
by lognormal functions provided that the field selection is tightly constrained to the 
cold, molecular zone and that noise and foreground contamination are appropriately 
accounted for. In general, field selections that incorporate warm, diffuse material in 
addition to the cold, molecular material will display apparent core+tail PDFs. The 
apparent tail, however, is best understood as the high extinction part of a lognormal 
PDF arising from the cold, molecular part of the cloud. We also describe the effects 
of noise and foreground/background contamination on the PDF structure, and show 
that these can, if not appropriately accounted for, induce spurious tails or amplify any 
that are truly present. 
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1 INTRODUCTION 

There has been a great deal of interest lately in the column 
density probability density functions (PDFs) of molecular 
clouds. The PDFs appear lognormal in form, but often with 
~power-law tails in the extreme positive wing. This has been 
linked to the role of gravity in star-forming regions (Kain- 
ulainen et al 2009; Froebrich &; Rowles 2010; Russeil et al 
2013; Alves de Oliveira et al 2014; Schneider et al 2014), as 
corresponding tails are seen in density PDFs derived from 
numerical simulations of isothermal clouds in 3D (Klessen 
2001; Cho & Kim 2011; Girichidis et al 2014). 

In this paper we show that fitting the column density 
PDF in the direct space representation, where the role of 
noise is more straightforwardly accounted for, is more re¬ 
liable than the methods of fitting in log space that have 
generally been used to-date. We show that the amplitude 
(or existence) of power-law tails in column density PDFs 
can be significantly affected by noise, and the presence of 
an unaccounted-for foreground/background contribution to 
column density. 

We also investigate the role of field selection , paying at¬ 
tention to the relative amounts of cold molecular gas and 
warm diffuse (atomic and molecular) gas in the field. The 
extinction-based molecular cloud column density fields ana- 
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lyzed below can be fitted with a single lognormal function, 
provided that the field is restricted to the cold molecular 
parts of the extinction and that noise is correctly accounted 
for in the fitting. Our results apply to PDFs sampled at a 
few tenths of a parsec resolution, comprising extinctions up 
to - 10 magnitudes. 

The layout of the paper is as follows. In Section 2, we 
describe the basic idea behind the fitting method. Section 3 
and Section 4 describe fitting PDFs with the standard log- 
space method and our direct space method, respectively. In 
Section 5, the systematics of noise and contamination are 
described. Section 6 documents the effect of field selection 
on the derived PDFs, followed by a Discussion in Section 7. 
Section 8 includes a brief aside on the scale- and resolution- 
dependence of derived PDF dispersions. Our Summary is 
given in Section 9. 


2 LOGNORMAL PDFS IN THE PRESENCE 
OF NOISE AND BACKGROUND 
UNCERTAINTY 

We will start with a baseline assumption that a column den¬ 
sity field can be described by a single lognormal function, 
and evaluate how well this represents the data for a sam¬ 
ple of column density fields determined through extinction 
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Figure 1. Fits (red) to the core of the log space PDFs (black) for the three fields. The 


fitting range is indicated by the dashed lines. 



Figure 2. Direct fits (blue) to the column density PDFs (black), including noise PDF convolution. The direct space representation of 
the fits from Figure^ are shown as red lines. 


mapping. Variations on this scheme will be presented sub¬ 
sequently. 

If the true column density PDF, f(N ), is lognormal 
then In ( N ) is normally distributed. Let In (iV) have a mean 
H and variance a 2 . Then the mean and variance of N are, 
respectively: 

Un = {N) = (1) 


a 2 N = (N 2 ) - (N) 2 = (e* 2 - l)e 2 '* +<r2 = (e ff2 - l)/&. (2) 


The assumed lognormal iV-PDF is given by: 
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If the column density field is observed in the pres¬ 
ence of Gaussian noise with mean zero and variance o^ oise , 
then assuming the noises and column densities are indepen¬ 
dent, then the expected observed mean, fiN,obs and variance, 
cT%,obsi are, respectively: 


lAN,obs — {Nobs) = I^N, 


(4) 
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From equations 0 and 0 we can obtain a noise-corrected 
estimate of the mean and variance of the noiseless iV-field if 
Onoise is known (these equations are generally true for inde¬ 
pendent signal and noise, not just in the case of a lognormal 
PDF). The log-space variance and mean are estimated, re¬ 
spectively, by: 


( 6 ) 
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where 


flN — I^N,obs, 

(8) 
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More generally, if each N is uncertain by a noise contri¬ 
bution, then the observed PDF, fobs, is just the true PDF, 
/, convolved with the noise PDF, fnoise- 


fobs — f G fnoise- (10) 

For extinction mapping, the noise variance may not be 
well-represented by a single value at all extinction levels, 
but instead is likely to rise with the extinction level. More 
elaborate schemes could take this into account, but here 
we will use a single value to represent the noise variance, 
noting that its ^constant value at low extinctions is most 
important. 

If the observed iV-field contains a contribution from un¬ 
related background (and/or foreground), we can consider 
the effect of this in simple situations. For a constant fore¬ 
ground/background contribution, Nfb = the observed 
mean is just biased high: 

HN,obs = I^N + Hfb- (11) 

The foreground/background may not be exactly constant, 
but it will likely be significantly less variable than the cloud 
of interest. Fitting to the iV-PDF could include a prelim¬ 
inary subtraction of fifb (allowed a free parameter in the 
fit). 


o 2 — ln(l + o%/ 
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Field 

a eff (mag.) 

Vfb (mag.) 

<r (log core fit) 

fi (log core fit) 

<j (direct fit) 

fi (direct fit) 

Taurus 

0.21 

0.00 

0.69 

-0.16 

0.81 

-0.25 

Perseus 

0.21 

0.07 

0.65 

-0.28 

0.89 

-0.546 

Rho Oph 

0.21 

0.54 

0.49 

+0.26 

0.97 

-0.17 


Table 1. Fitted parameters to the fields. Direct fits are performed after subtraction of /ifb- 


We can absorb some variability in Nfb into the noise 
variance, so that the effective noise variance is: 

2 _ 2.2 /-| 

®eff & Nfb T O no i sei 

though this won’t be a good representation of gradients in 
the foreground/background. For now we just allow varia¬ 
tions in Nfb to be Gaussian around the mean /ifb. In princi¬ 
ple, Nfb could also be lognormally distributed (for example) 
but for low enough variance around the mean, it will appear 
normal. Below, we allow crlff to vary around the nominal 
noise level to see if it improves the fit. 


3 FITTING TO THE PDF VERSUS In (AT) 

For reference, we perform fits on extinction maps using the 
widely-used procedure of fitting cores to the ln(iV)-PDFs 
over a restricted range of ln(iV) where the PDF appears 
gaussian (i.e. lognormal in N). All fits here are done using 
Ay in place of N. We have fitted three fields: Taurus and 
Perseus, taken from Rowles & Froebrich (2009) and Rho 
Oph, taken from the COMPLETE database (Ridge et al 2006). 
The field selection for these clouds was initially done by eye 
for Taurus and Perseus - we will describe in more detail the 
effects of field selection in Section 6. For Rho Oph, we just 
used the full field made available in the COMPLETE database, 
which is closely-concentrated on the molecular cloud. 

Figure [l] shows the PDFs of ln(Ay), along with a fit to 
the presumed core region, made between the vertical dashed 
lines. Excess ~ power-law tails of varying amplitude, rela¬ 
tive to the fitted lognormals, are seen (c.f. Kainulainen et al 
2009). In the fits above, we have not made a correction for 
the presence of foreground/background contamination (e.g. 
Schneider et al 2014, arXiv 1403.2996 submitted). These 
are negligibly small for Taurus and Perseus anyway. Nei¬ 
ther have we attempted to optimize the fitting range; this 
does not affect the major conclusions of the paper. 


4 DIRECT FITTING TO N WITH NOISE PDF 
CONVOLUTION 


Again using Ay in place of V, we take the observed maps, 
Ay )0 bs, and varying fifb and calculate: 

11A V = (Ay ;0 bs) — fifb (13) 

and 


2 _ 2 2 
^V ^Av,obs ^ e f f ' 


(14) 


We then create a lognormal PDF and convolve it with a noise 
PDF of variance c^//, looking for the values of fifb and cr\ff 
that provide the best fit to the observed PDF. Note that 
once fiA w and a\ v are calculated, no further optimisation 
of the fit by varying these estimates is performed, i.e. (for 
these initial fits) equations (13 14) hold exactly. 


We find that cr e // = 0.21 mag. provides the best fit for 
all fields: this is comparable to the noise estimates given in 
the data sources, and comparable to direct estimates from 
the fields themselves in “uninteresting” regions. Note that 
alff does not include calibration uncertainties, but is just 
an estimate of the noise variation in the maps. 

Little or no foreground/background correction is nec¬ 
essary for the Rowles & Froebrich maps as reference fields 
are used in the construction of these maps and they lie off 
the Galactic Plane near the Galactic anti-centre, but we 
find that the complete Rho Oph map is best fitted with 
a constant foreground/background level of fifb — 0.54 mag. 
(which is small compared to the Av 5 obs in the main Rho Oph 
cloud). No reference-level subtraction is reported in Ridge 
et al (2006) but their extinction map is calibrated to IRAS- 
derived 100 /im optical depth and therefore will necessar¬ 
ily include contributions from the full dust columns along 
the line-of-sight. Our estimated foregound/background of 
0.54 magnitudes implies an E(B-V) ps 0.17 (R = 3.1). This 
is consistent with the Schlegel, Finkbeiner & Davis (SFD; 
1998) E(B-V) map on the periphery of the Rho Oph field 
which has E(B-V) values in relatively unstructured regions 
towards higher/lower Galactic latitude of ~ 0.1 and 0.25 
respectively. The foreground/background across Rho Oph 
is therefore not constant but contributes an additional dis¬ 
persion above the Ridge et al (2006) nominal dispersion of 
0.16 magnitudes (a dispersion of 0.21 magnitudes is mea¬ 
sured directly in the field). If normally-distributed, the fore¬ 
ground/background fluctuations should have a dispersion in 
E(B-V) of ~ 0.044, which is comparable to that measured 
directly in the SFD map over scales comparable to the Rho 
Oph cloud (a dispersion of ~ 0.055 is measured directly; 
accounting for a 16% internal calibration dispersion, this 
predicts a natural dispersion of 0.048 for a mean E(B-V) 
level of 0.17). 

The resulting PDF estimates, after convolution with the 
noise PDF are shown in Figure [2] We also convert the gaus¬ 
sian fits to the ln(Ay) core into the direct Ay representation 
and show these for comparison. All fitted parameters are 
listed in Table [I] along with the log core fits of the previous 
Section. 

We convert the noise-convolved PDFs of Ay into the log 
representation for inspection. These are shown in Figure [3] 
and emphasize the excellence of the fit, both in the high Ay 
tail and the noise-dominated low Ay tail. (We will say more 
quantitatively about how good the fits are in the following 
Sections.) The appearance of a lognormal PDF with noise 
and/or offset is no longer symmetric about its maximum in 
the log representation; i.e. it is no longer a pure parabola. 
Counterintuitively, the width of the log space PDF near its 
peak is in fact narrowed by the noise convolution (see next 
Section), and the core fits to this region in the log represen¬ 
tation are not reliable without taking into account the noise 
smearing. 
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Figure 3. Log space representation of the direct lognormal fits (blue) from Figure [ 2 ] compared to the log space PDFs (black). The core 
fits are shown in red. On the Taurus and Perseus plots, a dashed line of slope logio(e) is shown (see text). 
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Figure 4. Direct space fits (blue lines) to the Taurus PDF with added noise (black lines). The best fit lognormal cr is quoted in each 
panel. 
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Figure 5. Log space lognormal PDFs with noise added (blue lines) compared to the noise-free fitted PDF of Taurus (black lines). 
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Figure 6. Effect of an unaccounted-for foreground/background level (l^fb)- The noise-convolved lognormal fit to Taurus, subjected to 
different [ifi values is shown as black lines. Fits to the core are shown as red lines. 
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In the log representation, the asymptotic slope of the 
negative PDF tail can be estimated, as long as the ampli¬ 
tude of the PDF is significant near Ay ~ 0 due to noise. 
Note that, as ln(Ay) becomes more negative, equal inter¬ 
vals in ln(Ay) correspond to progressively smaller intervals 
in Ay as it approaches zero. In the presence of noise, if 
we approximate the observed PDF as /(Ay)~constant over 
these small intervals, this predicts: 

/(ln(Ay)) = °c A v oc exp(ln(Av)) (15) 

in the negative log-space tail. In a plot of logio(f(ln(Ay))) 
versus ln(Ay) (Figure |3| the aysmoptotic slope is predicted 
to be logio(e) ~ 0.43. A dashed line with slope logio(e) is 
shown in the Taurus and Perseus panels, and agrees well 
with the observed slopes. For Rho Oph, the presence of a 
significant foreground has resulted in a low observed PDF 
amplitude near Ay « 0 so that the above approximation is 
inapplicable. A low PDF amplitude near Ay « 0 may also 
occur if the PDF is sampled only towards high column den¬ 
sity regions (see e.g. Section 6) so some care in interpreting 
the asymptote is needed. 

The moment-based fitting method has been also used 
recently by Butler, Tan, & Kainulainen (2014) and Kainu- 
lainen & Tan (2013). Butler et al (2014) do not find evi¬ 
dence for a power-law tail in their data, while Kainulainen 
& Tan (2013) show that the PDFs of their analyzed Infrared 
Dark Clouds, as well as Rho Oph, are consistent with sin¬ 
gle (broad) lognormal PDFs. In fact, in their re-analysis of 
the Kainulainen et al (2009) extinction data, Kainulainen & 
Tan (2013) present noise-corrected values for ctn/Hn, which 
lead to estimates for a of 0.77, 0.83, and 0.89 in Taurus, 
Perseus and Rho Oph respectively; these are similar to our 
values (0.81, 0.89, and 0.97) reported in Table [l] though a 
little lower despite higher resolution. This may be due to a 
slightly higher foreground/background contribution in their 
data. The inferred a from Kainulainen & Tan (2013) from 
moment analysis exceed the core-fitted values in Kainulainen 
et al (2009), by factors ~2. In general however, there is no 
definitive value for a that can be extracted from extinction 
maps, as we demonstrate below (Section 8). 

5 UNCERTAINTIES AND EXCESSES 
5.1 Effects of noise 

The method above is stable against noise. Negative column 
densities caused by noise pose no problem: they need not be 
excluded from the fit, and they contribute to constraining 
the low end of the PDF. In the log representation, very low 
and negative noise values are obviously problematic. Schnei¬ 
der et al (2014) have outlined a very careful scheme for deal¬ 
ing with noise and foreground/background contamination in 
log space, which relies on empirical corrections based on the 
shape and amplitude of the low Ay tail in log space. From 
the direct space perspective, this is accounted for in the fit¬ 
ting directly, using all the data. 

If the noise is Gaussian, then the leading uncertain¬ 
ties due to noise are the uncertainties in the noise’s zero 
level (±cr e /f /\J N P i X for N P i X pixels in the map) and the 
noise variance (Tv^c^/// y/N P i X — 1). For a large enough 
map, these uncertainties are negligible. For large N pix 


and Gaussian noise, the uncertainty on the lognormal a 

is /N pix )P/HA v + V^AvP 2 - This is vef y Sma11 

(~ 10 -3 ) for the maps above. 

To show the effect of noise, we added Gaussian noise 
of standard deviation a a dd — 0.5, 1.0, and 2.0 magnitudes 
to the Taurus map. Figure [4] shows fits obtained in direct 
space under the assumption that cr e // is known. The uncer¬ 
tainty in the lognormal a rises to ~ ±0.01 for a a dd ~ 2, in 
line with the above estimate. Note that the original cr e // in 
the held contributes negligibly to the total noise variance 
in these tests, so imprecise knowledge of its value is irrele¬ 
vant. Of course, as the noise variance increases, the detailed 
structure of the PDF, including any deviations from lognor- 
mality, is blurred out, and all we are able to extract are the 
parameters of the best-fitting lognormal. The effect of noise 
on the PDF in direct space is easily discerned: first, the PDF 
is broadened by the noise convolution, and second, the PDF 
peak is shifted upwards in Ay from its original peak (always 
below the lognormal’s mean) towards the lognormal’s mean. 

The broadening and shifting of the PDF in direct space 
have counterintuitive effects in the log representation. The 
shift of the direct space PDF in fact causes the log space 
PDF to become narrower (lower apparent a) near its peak, 
which is also shifted. To show this, we plot in Figure [5] the 
PDF fit to Taurus before noise convolution (i.e. an exact 
lognormal with parameters given in Table [l]) , compared to 
the PDF subjected to noise convolution, using cr e // = 0.21, 
0.5, and 1.0 magnitudes. At cr e // = 0.21 (the estimated noise 
level in the data), the log space PDF at In Ay just below the 
peak is suppressed , while the higher In Ay levels are affected 
much less. Thus the PDF is narrower around its peak. As 
(T e ff increases, the shift and distortion become greater, and 
the PDF narrowing around the peak is more emphasized 
(see also Fig. 5 of Schneider et al 2014). It is always possible 
to fit a parabola close to a smooth local peak, so, without 
taking into account the noise at the outset, the PDF fits to 
the core of the PDF will always be narrower (lower apparent 
a) than the true PDF, and spurious tails will be created or 
any existing tails will be amplified. 

5.2 Effects of foregound/background 
contributions 

The presence of an unaccounted for foreground/background 
contribution can have significant influence on the PDF in 
the log representation (Schneider et al 2014). In Figure [6] 
we show the (direct space) fitted PDF to Taurus, includ¬ 
ing the noise PDF convolution, subjected to small offsets 
of pifb — 0.0, 0.2, and 0.4 magnitudes, and transferred to 
the log representation. As noted above, fitting to the core, 
even for /ifb = 0.0 mag., leads to spurious excess in the 
tail, but this effect is amplified in the presence of fifb > 0. 
While the direct space effect is just a linear shift along the 
Ay axis, the log space effect is to shift and distort the PDF 
(i.e. similar to the effect of noise shifting the PDF peak). 
Fits made to the core as /ifb increases systematically pro¬ 
duce greater excess in the tail. It is therefore important to 
account for possible foreground/background contamination, 
especially for more distant clouds, as the contaminating ex¬ 
tinction will grow with distance. Extinction maps made us¬ 
ing reference fields will alleviate this problem, but column 
density maps made from far infrared emission will be sen- 
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Figure 7. Wide and narrow field selections for Taurus, Perseus, Orion B, and Orion A (clockwise from top left). The arrows denote 
the direction in which the fields are restricted from wide to narrow. Black contours are at Wco = 1 K km s -1 and 10 K km s -1 . 
Pixels determined to be X-dominated (see text) are coloured blue, with “diffuse” pixels coloured green. The X-dominated boundary 
roughly coincides with the Wco = 1 K km s -1 contour. The “cold” molecular part of the extinction is taken to lie within the Wco — 
10 K km s —1 contour. 


sitive to the entire dust column and therefore be more sus¬ 
ceptible to this bias. Schneider et al (2014), using Herschel 
imaging, indeed find foreground/background contributions 
of between 0.8 and 3.0 magnitudes, albeit for more distant 
clouds than analyzed here (0.45 - 7 kpc). 


6 FIELD SELECTION 

If the signal-to-noise level is high, the leading uncertainties 
on the field’s mean and variance estimates come from the 
field selection itself: i.e. exactly how the field is chosen (this 
can be somewhat arbitrary) effectively sets the mean and 
variance. The other important source of uncertainty (once 
the mean and variance are estimated) is the applicability of 
the lognormal model. Above, we have used this to predict 
/xa v and ctav? but these will differ from the predictions if the 
field is not lognormal. As the lognormal model appears to 
be well-motivated, this is a good first approximation, follow¬ 
ing which any deviations can be accounted for if necessary 
(e.g. the addition of a power-law component, including any 
modifications to the an d <j\y predictions). 

We note that while a lognormal can appear linear in the 
log representation over limited ranges, it cannot account for 


positive log-PDF curvature, or account for true power-law 
behaviour indefinitely. The PDFs shown in the previous Sec¬ 
tion in fact only displayed minor “core+tail” structure; i.e. 
the PDFs do not show a very distinct kink at any Ay level. 
There are some observed PDFs (e.g. Kainulainen et al 2009; 
Alves de Oliveira 2014; Schneider et al 2014) which display a 
more obvious core+tail behaviour, involving a more distinct 
high Ay departure from the low Ay behaviour. The high Ay 
departure from the low Ay lognormal core is not always an 
obvious power-law however, but often has the appearance of 
the high- Ay end of a broad lognormal function. 

6.1 Effects of field selection 

An extinction map does not distinguish between molecular 
and atomic and gas, while both will in general be present 
in the map. Perhaps a more sensible distinction to make 
is between cold (^isothermal) gas and warm(er) gas. This 
latter component we will refer to as “diffuse” and note that 
it could include both atomic and warm molecular material 
(e.g. CO-dark molecular gas; e.g. Langer et al 2014). 

The low Ay structure of observed PDFs will be con¬ 
trolled by the amount of diffuse material incorporated into 
the field selection. For different field selections, the amount 
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Figure 8. X-dominated fraction, /mol,x versus spatial scale for 
the field selections in Taurus (T), Perseus (P), Orion A (A), and 
Orion B (B). 


Figure 9. Measured y 2 f° r a single lognormal function that 
matches the mean and variance of Ay for each field selection, 
plotted versus the X-dominated fraction, Jmol,x- 


of diffuse material included will vary, and therefore the am¬ 
plitude of the “core” will vary. To demonstrate this, we have 
taken a more general approach to field selection, as shown 
in Figure [ 7 ] Here we show the Taurus and Perseus molecular 
clouds circumscribed by a “wide” field selection and a “nar¬ 
row” field selection. We have also included Orion A and B 
from Rowles &; Froebrich (2009) in this analysis, since these 
clouds display a more distinct “core+tail” PDF structure in 
Kainulainen et al 2009. 

Between the wide and narrow field selections, we im¬ 
pose a series of intermediate selections, with the direction of 
contraction of the field selection schematically represented 
by the arrows in Figure [ 7 ] For each selection, the mean and 
noise-corrected variance are calculated, a matching lognor¬ 
mal PDF is created and convolved with the noise PDF to 
produce a test PDF, ftest , which is then compared to the 
observed PDF, f obs - We have opted for a simple rectangu¬ 
lar field selection, to avoid the obvious problems that would 
arise in employing a cut in Ay to define the cloud. 

If the true PDF is not a pure lognormal, ftest will not be 
a good representation of fobs- As a goodness of fit statistic, 
we use the average of [In (f Q bs,i/ftest,i )] 2 N obs ,i (= reduced 
chi-squared, y 2 ) calculated over all PDF bins i where the 
observed PDF count, N 0 b s ,i is greater than 50 (note that the 
expected error on In (f obs ,i/ftest,i) is 1 /yj N obs ,i )• Field selec¬ 
tions which produce a distinct core+tail PDF will therefore 
have a higher y 2 than those that produce a pure lognormal 
PDF. In the tests below, we do not include accounting for 
foreground/background contamination (initially). 

To trace the amount of cold (molecular) versus dif¬ 
fuse material incorporated into each field selection, we use 
CO 1-0 data from Dame, Hartmann, & Thaddeus (2001; 
DHT01) to infer the contribution to the extinction aris¬ 
ing from molecular gas traced by CO. We take an X-fact or 
of 1.8xl0 20 cm -2 (K km s -1 ) -1 (DHT01) and a conver¬ 
sion between molecular hydrogen column density and Ay of 



Figure 10. Measured y 2 f° r a single lognormal function that 
matches the mean and variance of Ay for each field selection, 
plotted versus fw 10 — the fraction of CO J=l-0 intensity con¬ 
tributed by pixels exceeding Wco = 10 K km s -1 . 

2Nh 2 — 1.8xl0 21 cm -2 mag. -1 (Bohlin, Savage, & Drake 
1978). This results in: 

Av,mol,x = 0 2 Wco ) (16) 

mag. Kkms 

where Wco is the integrated intensity of the CO 1-0 line 
and Av ; mol,x is the inferred contribution to the extinction 
from molecular gas traced linearly by Wco- 

We convolved the Rowles & Froebrich Ay map to the 
same resolution as the DHT01 CO map (7.5 arcmin), and 
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resampled both maps onto the Rowles & Froebrich grid 
(2 arcmin). We designated a pixel as “X-dominated” if 
Av,mol,x/Av ^ 0.5. (The label X-dominated here refers 
to the dominance of material traced linearly by CO via the 
X-factor, and does not include CO-dark H 2 which is proba¬ 
bly warmer and more diffuse. Note also that CO emission is 
used only to define a lower threshold and we do not rely on 
Woo to accurately predict a corresponding Ay in the high 
column density limit where CO may saturate.) 

For each field selection, we determined the fraction of 
pixels that are X-dominated, and called this /mol,x- For 
reference, Figure [8] shows the X-dominated fraction versus 
spatial scale (in pc) for each cloud. The spatial scales are 
defined as the square root of the area of the field selec¬ 
tion, using distances of 140 pc (Taurus; Elias 1978), 300 pc 
(Perseus; Bally et al 2008), and 390 pc (Orion A and B; 
Mayne &; Naylor 2008). 

In Figure |9] we plot the measured x 2 (quantifying the 
success of a single lognormal in representing the observed 
PDF) against the measure X-dominated fraction, /mol,x, 
for each field selection. As the field selection is restricted 
from wide to narrow, focusing increasingly on X-dominated 
extinction structure, a single lognormal function becomes 
systematically a better fit to the observed PDF. For some 
fields (Perseus and Orion B, and Orion A to a lesser extent) 
the observed % 2 reaches a plateau before the narrowest field 
selection is approached - i.e. the PDF becomes essentially 
lognormal and remains lognormal for restricted sub-samples, 
perhaps hinting at self-similar behaviour. For reference, the 
original field selections for Taurus and Perseus in Sections 3 
and 4, were at 25 pc (/mol.x ~ 0.8) and 41 pc (/mol,x ~ 
0.5) respectively. 

There does not appear to be a definitive value of 
f mol,x at which the minimum in x 2 is reached - a range 
of 0.6 < fMOL,x < 1 is seen - but this will depend on 
the structure of the extinction in the immediate vicinity of 
the main X-dominated region for each cloud. The surround¬ 
ings of Taurus, for example, are relatively molecule-rich, by 
the above definition, compared to the other fields. A mass- 
weighted X-dominated fraction can be calculated, and this 
suggests that the PDFs transition to pure lognormal form 
when the mass incorporated into the PDF is 80-100% X- 
dominated, by the above definition. 


To focus more explicitly on the cold molecular compo¬ 
nent, we have also defined a fraction, fw 10 , of sight lines 
which have Wco ^ 10 K km s _1 . This will more effec¬ 
tively select the coldest ^isothermal gas, while ignoring the 
warmer molecular cloud envelopes (e.g. Pineda et al 2010). 
A plot of x 2 versus fw 10 is shown in Figure 
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There is a 

little more coherence to the trend-lines in this Figure (versus 
Figure |9j, indicating that it is in the coldest molecular gas 
that the PDF tends to a pure lognormal form. A fraction 
fw 10 > 0.8 marks a reasonable transition point between 
“good” and “poor” fits of a single lognormal. The rate at 
which the fits deteriorate as the cold molecular zone is ex¬ 
ceeded (fw 10 falls) is variable amongst the clouds however. 
For Orion A and B, a single lognormal quickly becomes a 
very poor fit to the PDF once fw 10 drops below ~0.8, while 
for Taurus and Perseus the deterioration is more gradual. 
We will return to this point below. 


6.2 Appearance of the PDFs 


In Figure [Tl] we show PDFs measured in the molecule- 
dominated regions for each cloud, compared with the PDF 
measured in the widest field selection. For the molecule- 
dominated PDFs, we have used the largest field selection 
at which x 2 appears to plateau, and now accounted for fore¬ 
ground/background contamination. For Orion B, we find 
that 11 fb — 0.14 mag. provides the best fit (which is higher 
than that in the more nearby Perseus and Taurus, for which 
the previous estimates in Table 1 are good). The extinc¬ 
tion structure around Orion A is more complicated: fur¬ 
ther from the Galactic plane, the extinctions adjacent to 
the Orion A X-dominated region are low (~ 0.2 mag.) while 
closer to the Galactic plane, the extinctions are notably 
higher (~ 1 mag.). The uncertainty in the baseline extinc¬ 
tion level can be seen in the Orion A PDF at low Ay: the 
model lognormal PDF is not a good representation of this 
bimodal extinction distribution, leading to relatively higher 
X 2 for Orion A than seen in the other fields. We find that 
foreground/background levels of ~0-0.2 mag. are effectively 
equivalent in terms of x 2 for Orion A, but a more sophisti¬ 
cated baseline level is needed here. 

Figure [Tl] shows that the PDF counts are essentially 
unmodified by the field restriction above a level of Ay ~ 2 
(perhaps a little higher for Taurus). The amplitude of the 
low Ay peak for intermediate field selections lies between 
the two extremes shown in Figure HH It is instructive to 
inspect all the log-space PDFs obtained from the various 
scale-dependent field selections, and these are shown in Fig¬ 
ure [12] Red and blue lines show the direct-space lognormal 
fits translated into the log representation for the two small¬ 
est, molecule-rich, field selections. (As noted above, Orion A 
has more complicated structure in the surrounding extinc¬ 
tion, and its PDFs are not accounted for very well at low 


Ay.) It is evident from Figure 12 that there are two types 
of general PDF structure. The PDFs of Taurus and Perseus 
show more gradual deviation from lognormal behaviour as 
the field size is increased to include warmer, diffuse mate¬ 
rial. In particular, the curvature of the log-log PDFs remains 
negative (concave downwards). In contrast, Orion A and B 
have PDFs with a distinct inflection at ln(Ay) ~ 0.8-1. This 
can be compared to Figure [lO] where the PDFs for Orion A 
and B quickly become non-lognormal once the cold molecu¬ 
lar zone is exceeded by the field selection. 

We also extracted PDFs in a number of rectangular 
windows near the molecule-dominant regions to sample “dif¬ 
fuse” PDFs. These are reasonably-well fitted by lognormals 
of low dispersion (a ~ 0.04-0.45) with the higher dispersions 
primarily driven by very low mean Ay. 


7 DISCUSSION - THE CORE/TAIL DUALITY 

In general, the wide-field PDFs display core+tail structure. 
The amplitude of the “core”, and therefore of any inferred 
deviations from it, is directly dependent on the rather ar¬ 
bitrary amount of diffuse material included in the field se¬ 
lection and therefore incorporated into the PDF. In many 
cases, the field selection is simply set by the availability of 
data - i.e. determined by the limits over which the cloud is 
mapped. 
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Figure 11. Fitted lognormals (blue lines) to the molecule-dominated PDFs (heavy black lines) for Taurus, Perseus, Orion A, and 
Orion B. Also shown are the wide field PDFs (lighter black lines). 


For the molecule-dominated regions, especially the cold¬ 
est regions, the PDF is well-described by a pure lognormal, 
which is the theoretically expected form for isothermal tur¬ 
bulence. It is likely that the molecule-dominated regions (at 
least as defined by our CO-based criterion - which trace the 
nominal “molecular clouds” in each case) can be reasonably- 
well described as isothermal. This means that, at the resolu¬ 
tions probed by the Rowles & Froebrich data (a few tenths of 
a pc), the core/tail duality is a duality between warm diffuse 
material and cold molecular material, rather than a duality 
between nominal “lognormal core” behaviour and gravity- 
induced power-law behaviour associated with star forma¬ 
tion. That is: the “tail” part of the wide-field PDF is the 
high extinction part of a lognormal PDF that arises collec¬ 
tively from the main body of the cold molecular cloud. The 
distinction between these two scenarios is shown schemati¬ 
cally in Figure [13] though elements of both may play a role 
in general. 

One could ask whether, for an arbitrary field selection, 
a combination of lognormal core + power-law tail could pro¬ 
vide a better description of the data. But, by Ockham’s Ra¬ 
zor, there is no need for a power-law contribution, as the ob¬ 
served x 2 values are ^unity for a lognormal PDF in the cold, 
molecule-dominant regions (excepting Orion A for reasons 
that are understood), once a systematic approach to field se¬ 


lection is employed. This is not to say, of course, that gravity 
definitively plays no role in the molecular cloud structure. 

From the evidence of the PDF alone, it is not possi¬ 
ble to distinguish between the turbulence/gravity scenario 
or the warm/cold scenario, though in the above we have 
demonstrated that PDFs sampled from the cold, molecu¬ 
lar region are lognormal, and we believe that this favours 
the warm/cold picture, with no role for gravity strictly nec¬ 
essary. Kainulainen et al (2011) have also suggested that 
the formation of pressure-bound clumps may precede any 
gravitationally-driven structure. In a multi-phase turbulent 
picture, the cold gas will likely have a higher Mach number 
due to its low temperature (e.g. at fixed velocity disper¬ 
sion), and this would drive a greater dispersion in density 
and therefore column density (the molecular PDFs would 
have higher log-dispersion than the diffuse PDFs). It is also 
known that PDFs generated by the thermal instability will 
usually be bi-modal, or at least non-lognormal in a similar 
way to those observed above (e.g. Audit & Hennebelle 2010; 
Gazol & Kim 2013; Kim, Ostriker & Kim 2013; Saury et al 
2014; Heiner et al 2014, arXiv 1403.6417 submitted), in part 
presumably because of the increase in Mach number as the 
gas cools. 

However, since the PDF is a relatively crude measure 
of the key underlying physics (Tassis et al 2010), this does 
not rule out a role for gravity in the creation of the molecu- 
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Figure 12. Fitted lognormals (blue and red lines) to the PDFs obtained from the two smallest, molecule-rich, field selections (heavy 
black lines) for Taurus, Perseus, Orion A, and Orion B. Also shown are the wider field PDFs (lighter black lines) bracketed by the largest 
field selection again shown with a heavier black line. 




Figure 13. Schematic depiction of the core/tail duality for the isothermal turbulence + gravity picture (left) and the multi-phase picture 
(right). 
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Figure 15. PDFs and their dispersions as a function of resolution for Taurus. 
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Figure 14. Normalised extinction variance, cr \ v /l J '\ v ^ versus 
spatial scale for the field selections in Taurus (T), Perseus (P), 
Orion A (A), and Orion B (B). 


lar cloud initially (e.g. Ballesteros-Paredes et al 2011). Nei¬ 
ther does it rule out the possible detection of deviations 
from (cold, molecular) lognormality that might be seen at 
higher resolution. Indeed, for the highest spatial resolution 
field (Taurus) there is a slight excess in the PDF, above the 
nominal lognormal behaviour, near Ay ~ 6 — 8 mag. (though 
there does not appear to be a corresponding distinct excess 
in that Ay range in the Taurus PDF measured at slightly 
higher resolution by Kainulainen et al 2009.) Power-law ex¬ 
cesses beginning at this Ay level have in fact been suggested 
by Froebrich & Rowles (2010) and Schneider et al (2014). 

If it is the transition between large-scale/warm/diffuse 
material to smaller-scale/cold/molecular material that 
drives the change in PDF structure, then the abruptness of 
the transition may give clues on the formation mechanism of 
the cloud. The transitions in Orion A/B are distinctly more 
abrupt than those in Taurus/Perseus, as evidenced by the 
cur vat ure/inflect ions of the PDFs in log-space and by the de¬ 
terioration of the single lognormal fit when the field selection 
exceeds the cold molecular zone. However, for anisotropic 
clouds, the viewing angle is important (Ballesteros-Paredes 
et al 2011). 


8 SCALE- AND RESOLUTION-DEPENDENCE 

OF THE COLUMN DENSITY VARIANCE 

It is not our intention here to measure the dispersions of 
the PDFs, though it is worth a brief comment on the diffi¬ 
culties of this task. We gave measurements of dispersions in 
Table 1 solely for the purpose of comparing two methods of 
measurement. In general, the measured dispersion will de¬ 
pend on the field selection and the resolution at which the 
dispersion is measured. We will briefly explore these issues 
below. 

The lognormal cr values for the Orion A and Orion B 
molecule-dominated regions (for the PDFs in Figure |TT] ) are 
0.53 and 0.71 respectively. While the variance in Ay is higher 
in Orion A, the mean Ay is also higher by an amount that 
results in a lower a than in Orion B - see equation ©• 
Both the Orion a values are notably lower than a listed 
for Perseus and Taurus in Table 1. This is in part due to 
a resolution dependence, as discussed below, but the mea¬ 
sured lognormal a in fact varies significantly, depending on 
the field selection. As the field is narrowed more and more, 
the mean Ay rises, and although the variance also rises, the 
overall effect is typically to lower the measured a. This oc¬ 
curs in ah the fields analyzed above. To quantify this, we 
show, in Figure |14| the variation of normalized extinction 
variance (related to a via equation <©) as a function of the 
scale at which it is measured. There is no convergence as 
the molecular zone is breached at small scales. For the same 
field shape and centre, the measured a is generally larger for 
larger field selections - though at some point a single lognor¬ 
mal will cease to represent the PDF. We suggest that a field 
selection that maximises a subject to simultaneously lying 
in the plateau of % 2 may provide an objective definition for 
a applicable to the cold, molecular zone. 

The second issue is that of the resolution-dependence 
of cr. Obviously, at higher resolution the field variance must 
increase while the mean remains constant. The Taurus field 
is large enough to perform quantitative tests of this, so we 
have degraded the resolution of this field to 8 arcmin and 
16 arcmin from the original 4 arcmin. Instead, this can be 
viewed as moving the cloud to distances of 280 pc and 560 pc 
respectively. The resulting PDFs and measured a are shown 
in Figure pA] Interestingly, the PDF remains essentially log¬ 
normal as the resolution is changed, indicating some form of 
self-similar behaviour. The measured a falls as the resolu¬ 
tion is degraded. Given the spectral slopes of column density 
power spectra in molecular clouds, the dispersion measured 
in 2D should converge as the resolution is increased, and it 
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may be possible to infer the value at convergence. To es¬ 
timate the corresponding 3D dispersion in density is more 
difficult, as that is very slowly convergent, if at all (see dis¬ 
cussion in Brunt 2010). 


9 SUMMARY 

We have shown that extinction PDFs of a sample of molec¬ 
ular clouds obtained at a few tenths of a parsec resolution, 
probing extinctions up to Ay ~ 10 magnitudes on scales 
of a few tenths of a parsec, are very well described by log¬ 
normal functions provided that the field selection is tightly 
constrained to the cold, molecular zone and that noise and 
foreground contamination are appropriately accounted for. 
In general, field selections that incorporate warm, diffuse 
material in addition to the cold, molecular material will dis¬ 
play apparent core+tail PDFs. The apparent tail, however, 
is best understood as the high extinction part of a lognormal 
PDF arising from the cold, molecular part of the cloud. 

For a general field selection that includes warm and cold 
gas, the PDF shows one of two forms. Either the transition 
to the molecular zone is abrupt, resulting in an inflection in 
the log-log PDF structure, or the transition is more gradual 
and the log-log PDF retains negative (concave down) cur¬ 
vature at all Ay. The former of these is also traced by an 
abrupt deterioration in y 2 when a single lognormal is used 
to fit PDFs that include progressively more diffuse material, 
while the latter shows a more gradual deterioration in y 2 - 
These features may provide clues on the formation mecha¬ 
nism of the molecular cloud, but could also be influenced by 
the viewing angle. 

In terms of fitting the PDF, convolution of the model 
PDF with the noise PDF is essential. With noise convolu¬ 
tion, the fitted lognormal PDFs account for both the high 
Ay behaviour of the PDF and the noise-dominated low Ay 
behaviour in the analyzed fields. The method used here also 
has the advantage of directly accounting for both the mean 
and variance of the column density, which is desirable. Fit¬ 
ting alternative models with predictable mean and variance, 
or use of higher order moments beyond mean and variance, 
can be simply done. 
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